PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A06G1734
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 409aa    MW: 46135.9 Da    PI: 6.1179
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A06G1734genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix83.13.5e-26292376186
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                  rW++ ev+aLi +r+ +e+++  + +k   W+e+s+ m   g++rs+k+Ckekwen+nk+++k   + kk+  e+s++c+yf++l+
  Gh_A06G1734 292 RWPDAEVQALIMLRSTLEHKFHVTGSKCSIWDEISAGMYNMGYSRSAKKCKEKWENINKYFRKSMGSGKKH-HENSKRCAYFHDLD 376
                  8******************************************************************9998.78888*******97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500906.957285349IPR017877Myb-like domain
CDDcd122034.11E-25291355No hitNo description
PfamPF138372.8E-18291377No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 409 aa     Download sequence    Send to blast
MELFTGGREA FPQHVAPFPD LTAIIEPVDD LMMSDHRPTL PPRKLRPIRY NGRSPASSQA  60
EDPSEFAEAV ELVGDEVCAI NGSSFDYMTP PIKAEVGDVT ATVGGRGSGV EGPPSSEQRG  120
EPSGSSSSDS DDDLSATGNE PLKKRKRKSR KKIQLFLEKL VMKVMDKQEQ MHKQLMEMIE  180
KREKERLIRE EAWKRQEMER VKRDEEARAQ EMSRSIALIS FIQNALGHEI EIPISTMSCM  240
EENGVKDASE DHIQKDTVNP FGPTNRWQEG TMQANGAENH EGGVSCDPNN RRWPDAEVQA  300
LIMLRSTLEH KFHVTGSKCS IWDEISAGMY NMGYSRSAKK CKEKWENINK YFRKSMGSGK  360
KHHENSKRCA YFHDLDVLYK NGFGNPVNHI NCIKVDNMDN GESLKGNED
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1142147KKRKRK
2142150KKRKRKSRK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00552DAPTransfer from AT5G47660Download
Motif logo
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012449412.10.0PREDICTED: trihelix transcription factor GT-2 isoform X1
TrEMBLA0A0B0N5180.0A0A0B0N518_G
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM94112736
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G47660.11e-42Trihelix family protein